Morphdb.hu: Hungarian lexical database and morphological grammar

نویسندگان

  • Viktor Trón
  • Péter Halácsy
  • Péter Rebrus
  • András Rung
  • Péter Vajda
  • Eszter Simon
چکیده

This paper describes morphdb.hu, a Hungarian lexical database and morphological grammar. Morphdb.hu is the outcome of a severalyear collaborative effort and represents the resource with the widest coverage and broadest range of applicability presently available for Hungarian. The grammar resource is the formalization of well-founded theoretical decisions handling inflection and productive derivation. The lexical database was created by merging three independent lexical databases, and the resulting resource was further

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Benefits of Resource-Based Stemming in Hungarian Information Retrieval

This paper discusses the impact of resource-driven stemming in information retrieval tasks. We conducted experiments in order to identify the relative benefit of various stemming strategies in a language with highly complex morphology. The results reveal the importance of various aspects of stemming in enhancing system performance in the IR task of the CLEF ad-hoc monolingual Hungarian track. T...

متن کامل

A New Integrated Open-source Morphological Analyzer for Hungarian

The goal of a Hungarian research project has been to create an integrated Hungarian natural language processing framework. This infrastructure includes tools for analyzing Hungarian texts, integrated into a standardized environment. The morphological analyzer is one of the core components of the framework. The goal of this paper is to describe a fast and customizable morphological analyzer and ...

متن کامل

Creation and Development of the Romanian Lexical Resources

This article describes the Romanian lexical resources containing morphological data and dictionaries: synonyms, Romanian-English, and RomanianRussian. The inflection process at the creation of morphological resources based on the functional grammar with scattered context is considered. An arbitrary word is inflected knowing only its part of speech, and the gender for nouns. New words were obtai...

متن کامل

The GeLexi MT Project

Our double Machine (-Aided) Translation project to be demonstrated here (sections 3-4) is based on the parsing project [5-9, 12] of our Pécs theoretical and computational linguistics research team, called GeLexi (‘Generative LEXIcon’), whose (knowledge-based) parsers (section 2) rely on “totally lexicalist” [21, 2, 4, 7] representative mini-grammars of English and the (morphologically very rich...

متن کامل

On Hungarian Morphology

The aim of this study is to provide an autosegmental description of Hungarian morphology. Chapter 1 sketches the (meta)theoretical background and summarizes the main argument. In Chapter 2 phonological prerequisites to morphological analysis are discussed. Special attention is paid to Hungarian vowel harmony. In Chapter 3 a universal theory of lexical categories is proposed, and the category sy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006